REMARKS 



Status of the Claims 

Claims 26, 30 and 31 are pending in the application. Claims 26, 30 
and 31 are rejected. Claim 31 is canceled and claims 26 and 30 are amended 
herein. No new matter is added to the amended claims. 

Claim amendments 

Claim 31 is canceled and its limitation incorporated in Claim 26 to 
overcome the rejection under 35 U.S.C. §1 12, first paragraph. Amended claim 26 
is drawn to a method of producing activated T cells directed towards stratum 
corneum chymotrytic enzyme (SCCE) in an individual with ovarian cancer, 
prostate cancer or breast cancer. Such a method comprises the steps of exposing 
dendritic cells to a human stratum corneum chymotrytic enzyme peptide selected 
from the group consisting of SEQ ID NOs: 31, 32, 33, 34, 35, 36, 80, 86, 99 or the 
human stratum corneum chymotrytic enzyme protein encoded by the DNA of SEQ 
ID NO: 30, thereby producing activated dendritic cells. The activated dendritic 
cells are then exposed to T cells, where the activated dendritic cells would 
present the stratum corneum chymotrytic enzyme peptide to the T cells, thereby 
producing activated T cells directed toward the stratum corneum chymotrytic 
enzyme in the individual with ovarian cancer, prostate cancer or breast cancer. 

Claim 30 is amended to properly depend from amended claim 26. 
Amended claim 30 recites the step where the dendritic cells are isolated from the 



individual prior to the exposure to the stratum corneum chymotrytic enzyme 
polypeptide, where the activated dendritic cells are reintroduced into the individual 
subsequent to the exposure. 

The 35 U.S.C. §112. First Paragraph Rejection 

Claims 26 and 30-31 remain rejected under 35 U.S.C. §112, first 
paragraph for lack of enablement. Applicant respectfully traverses this rejection. 

The Examiner states that the Declarations filed on 2/19/2003, 
1/23/2006 and 9/1/2006 are insufficient to overcome the rejection of claims 26, 30 
and 31 based upon the insufficiency of the disclosure under 35 U.S.C. §112, first 
paragraph because the showing is not commensurate in scope with the claims. 
Additionally, the Examiner states that although the Declarations submitted provide 
evidence that stratum corneum chymotrytic enzyme peptides with SEQ ID NOs: 
32 and 33 can induce CTL response in vitro, the claims still encompass stratum 
corneum chymotrytic enzyme peptides other than SEQ ID NOs: 32 and 33 that do 
not contain amino acids 5-13 or 123-131 of SCCE as well as the treatment of just 
any disorder, particularly ovarian, prostate, breast and colon cancer as recited in 
claim 31. 

Furthermore, the Examiner states that although the instant claims 
are drawn to stratum corneum chymotrytic enzyme dendritic cell immunotherapy 
in cancer patients, the Declarations are limited to in vitro evidence and nucleic 
acid expression rather than polypeptide expression. The Examiner then cites 
teachings of Fu et al, Powell et al, Vallejo et al and Lewin B to state that 
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analysis of levels of polynucleotide transcripts cannot be relied upon to anticipate 
levels of protein expression and one of skill in the art cannot anticipate that the 
level of a specific mRNA expressed by a cell will be paralleled at the protein level 
due to complex homeostatic factors controlling translation and post-translational 
modification. 

With regards to the references submitted by the Applicant in the last 
response, the Examiner states that with exception to the art of Noriaki ef al, 
Higuchi ef a/ and Van Gluck et al, the other references are limited to in vitro 
stimulation of dendritic cells and provide a starting point for further 
experimentation into their use for patient specific immunotherapy. While pointing 
out to the fact that Higuchi et al and Van Gluck et al were published after the 
filing date of the present application, the Examiner states that the claims still 
encompass stratum corneum chymotrytic enzyme peptides other than SEQ ID 
Nos: 32 and 33 that do not contain amino acids 5-13 or 123-131 of stratum 
corneum chymotrytic enzyme and there is no evidence that the stratum corneum 
chymotrytic enzyme signal peptide is part of the mature stratum corneum 
chymotrytic enzyme protein expressed in cancer patients such that the stratum 
corneum chymotrytic enzyme activated T cells targeting this sequence effectively 
inhibit tumor growth in a patient. 

The Examiner further reiterates that in view of unpredictability in the 
art to which the invention pertains as evidenced by Cranmer ef al, Soruri et al, 
Wang ef al and Geysen et al, the lack of established clinical protocols for 
effective dendritic cell therapies, undue experimentation would be required to 
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practice the claimed methods with a reasonable expectation of success, absent a 
specific and detailed description in the instant specification of how to effectively 
practice the claimed methods and absent working examples providing evidence 
which is reasonably predictive that the claimed methods of dendritic cell 
immunotherapy to treat ovarian and prostate cancer patients in particular, 
commensurate in scope with the claimed invention. For these reasons, the 
Examiner has maintained rejection of claims 26 and 30-31 for lack of enablement. 

Claim 26 is amended as discussed supra and is drawn to a method 
of producing activated T cells directed towards stratum corneum chymotrytic 
enzyme in an individual with ovarian cancer, prostate cancer or breast cancer. 
The inclusion of breast cancer is supported by the teaching in the art (Fig. 6, 
Harvey et al., 2000, J Biol Chem, 275(48): 37397-37406) and in the Declaration 
included with the response filed August 4, 2006, both of which disclose 
expression of stratum corneum chymotrytic enzyme in breast cancer. 

With regard to the Examiner's contention that there was no evidence 
of induction of CTL response provided for peptides other than the ones with SEQ 
ID NOs: 32 and 33, Applicant would like to respectfullly point out the patentability 
rules, which state that "the specification need not contain an example if the 
invention is otherwise disclosed in such a manner that one skilled in the art will be 
able to practice it without undue experimentation. Lack of working example, 
however, is a factor to be considered, especially in a case involving an 
unpredictable and undeveloped art. But because an enabling disclosure is 
required, Applicant need not describe all actual embodiments" (M.P.E.P. 
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2164.02). Since the Declarations enable the claimed method for peptides with 
SEQ ID Nos: 32 and 33, Applicant submits that a person having ordinary skill in 
this art will be able to practice the claimed method using other peptides without 
undue experimentation. 

In response to the Examiner's statement that analysis of levels of 
polynucleotide transcripts should not be relied upon to anticipate levels of protein 
expression, Applicant states that all the techniques mentioned herein are well 
known and acceptable in the art and are routinely used by investigators to 
examine the expression of genes in several disorders. Given this, one skilled in 
the art can easily examine expression of proteins in these cancers using 
techniques that are routine in the art, for instance, western blots, 
immunohistochemical analysis, ELISA etc. 

Further, in response to Examiner's statement that the evidence 
provided by the Applicant with regards to dendritic cell immunotherapy were mostly 
limited to in vitro experiments, Applicant would like to respectfully point out that in 
vitro studies are accepted by those with ordinary skill in the art of gene therapy and 
anticancer therapy as being predictive of success in vivo. Furthermore, the 
Examiner has not cited any case law stating that the 35 U.S.C. §112 requires in vivo 
or human clinical data for Applicant's method to satisfy the enablement requirement 
Nevertheless, as stated by the Examiner, Noriaki et al demonstrated efficacy of 
using dendritic cell immunotherapy in patients. Thus, dendritic cell immunotherapy 
is not as unpredictable as stated by the Examiner. 
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Additionally, with regard to the issue of whether stratum corneum 
chymotrytic enzyme signal peptide is expressed in cancer patients, Applicant states 
that it is not necessary to show that the signal peptide is expressed in cancer 
patients. The most relevant demonstration is of the immunogenic potential of the 
signal peptide. The Declaration provided with the response filed January 20, 2006 
demonstrated that the peptide with SEQ ID NO: 33 that is derived from a signal 
peptide sequence was capable of inducing a CTL response to lyse CaOV3 ovarian 
tumor cells that were not pulsed with the peptide. This data provides a strong 
evidence that the peptide with SEQ ID NO: 33 may be naturally expressed on the 
ovarian tumor cells. Since the enablement requirement does not require that all the 
embodiments be described, Applicant contends that the in vitro data provided by the 
Applicant and the references provided by the Applicant demonstrating the efficacy of 
dendritic cell immunotherapy in vitro and in vivo are sufficient to enable the instantly 
claimed methods. 

Applicant reiterates as stated in In re Woody, 331 F.2d 636, 639, 
141 USPQ 518, 520 (CCPA 1964): "It appears that no one on earth is certain as 
of the present whether the process claimed will operate in the manner claimed. 
Yet absolute certainty is not required by the law. The mere fact that something 
has not previously been done clearly is not, in itself, a sufficient basis for 
rejecting all applications purporting to disclose how to do it." A novelty in 
the claimed invention relates in part to the identification of immunogenic stratum 
corneum chymotrytic enzyme peptides and not to the art of loading the dendritic 
cells with these peptides. Nevertheless, the induction of cytotoxic T cell response 
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by some of the claimed stratum corneum chymotrytic enzyme peptides is 
sufficient to demonstrate the success in loading of dendritic cells with these 
peptides, culturing the loaded dendritic cells and production of immune activated 
T cells. Therefore, a fair reading of the instant specification along with the 
information available in the art should enable one of skill in the art to make and 
use the claimed invention without undue experimentation. Thus, Applicant 
submits that the scope of the claimed invention is commensurate with the 
enablement provided. Based on the above-mentioned amendments and remarks, 
Applicant respectfully requests the withdrawal of rejection of claims 26 and 30-31 
under 35 U.S.C. §112, first paragraph. 

Claim 31 stands rejected under 35 U.S.C. §112, first paragraph as 
failing to comply with the written description requirement as introducing new 
matter. Applicant respectfully traverses this rejection. 

The Examiner states that the response filed by the Applicant on 
September 1, 2006 introduced new matter into the claims. As presently amended 
claim 31 is drawn to the claimed immunotherapeutic method of wherein the 
individual has breast or colon cancer. The Examiner further states that although 
the PTO has the initial burden of presenting evidence or reasons why persons 
skilled in the art would not recognize description of the invention defined by the 
claims in the disclosure, when filing an amendment the applicant should show 
support in the original disclosure for new or amended claims. Hence, the 
Examiner states that a sufficient written support in the specification or claims as 
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filed should be provided by the Applicant for the limitations recited in the presently 
amended claim 31 or these limitations must be removed from the claims in 
response to this Office Action. 

Claim 31 is canceled herein. Additionally, amended claim 26 is 
drawn to a method of producing activated T cells directed towards stratum 
corneum chymotrytic enzyme in an individual with ovarian, prostate or breast 
cancer. The limitation of "colon cancer" is not included in amended claim 26. 
Applicant submits that expression of stratum corneum chymotrytic enzyme in 
breast cancer was known in the art (Fig. 6, Harvey et al., 2000, J Biol Chem, 
275(48): 37397-37406) at the time the invention was filed and that one skilled in 
the art would easily recognize that the instantly claimed method would also be 
useful in inducing an immune response in breast cancer. Furthermore, the 
declaration provided by the Applicant along with the response filed August 4, 
2006 had data that was obtained prior to the filing of the instant invention 
(September 9, 1997) and further confirmed the expression of stratum corneum 
chymotrytic enzyme in breast cancer. Hence, Applicant contends that the 
inclusion of breast cancer in the amended claim 26 did not introduce any new 
concept. Accordingly, based on this amendment and remarks, Applicant 
respectfully request the withdrawal of rejection of claim 31 under 35 U.S.C. §1 12, 
first paragraph. 

This is intended to be a complete response to the Office Action 
mailed November 17, 2006. Applicant submit that the pending claims are in 
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condition for allowance. If any issues remain outstanding, please telephone the 
undersigned attorney of record for immediate resolution. 



Respectfully submitted, 




Registration No. 35,423 
Counsel for Applicant 



ADLER& ASSOCIATES 
8011 Candle Lane 
Houston, Texas 77071 
713-270-5391 
badler1@houston.rr.com 
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The tissue or glandular kallikreins (KLK) are mem- 
bers of a highly conserved multigene family encoding 
serine proteases that are central to many biological pro- 
cesses. The rodent KLK families are large, highly con- 
served and clustered at one locus. The human KLK gene 
family is clustered on chromosome 19ql3.3-13.4, and un- 
til recently consisted of just three members. However, 
recent studies have identified up to 11 new members of 
the KLK family that are less conserved than their rodent 
counterparts. Using a Southern blot and sequence anal- 
ysis of 10 BACs and cosmids spanning approximately 
400 kilobases (kb) either side of the original KLK 60-kb 
locus, we demonstrated that these genes also lie adja- 
cent to this. We have also clarified the position of several 
microsatellite markers in relation to the extended KT,ir 
locus. Moreover, from Southern blot analysis of the cos- 
mids and BACs with a degenerate oligonucleotide probe 
to the histidine-encoding region of serine proteases, we 
have shown that there are no other serine protease 
genes approximately 400 kb centromeric and 220 kb te- 
lomeric of the extended locus. We performed an exten- 
sive analysis of the expression patterns of these genes by 
poly(A) + RNA dot blot and reverse transcriptase-polym- 
erase chain reaction analysis, and demonstrated a di- 
verse pattern of expression* Of interest are clusters of 
genes with high prostate (KLK2-4) and pancreatic 
(KLK6-13) expression suggesting evolutionary conser- 
vation of elements conferring tissue specificity. From 
these findings, it is likely that the human KLK gene 
family consists of just 14 clustered genes within 300 kb 
and thus is of a comparable size to the rodent families 
(13-24 genes within 310 and 480 kb, respectively). In 
contrast to the rodent families, the newest members of 
the human KLK family are much less conserved in se- 
quence (23-44% at the protein level) and appear to con- 
sist of at least four subfamilies. In addition, like the rat, 
these genes are expressed at varying levels in a diverse 
range of tissues although they exhibit quite distinct pat- 
terns of expression. 
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Serine proteases are a group of protein-cleaving enzymes 
that contain a serine residue in their active site and play 
important roles in diverse physiological processes. The glandu- 
lar or tissue kallikreins -are a subgroup of serine proteases 
highly conserved across several species, that are involved in 
the post-translational processing of polypeptides to their bio- 
active or inactive forms, a function that underlies and is central 
to most biological events (1). Tissue kallikrein (encoded by 
KLK1), 1 from which this family derives its generic name, pro- 
cesses kinin ogen to release (lys)-bradykinin, a multifunctional 
peptide involved in the regulation of local blood flow, sodium 
balance, inflamm ation, and cell proliferation in many tissues 
(2). Other members of the KLK family include growth factor 
processing enzymes such as mouse epidermal growth factor- 
binding protein, y-nerve growth factor, and enzymes that gen- 
erate vasoactive peptides such as the mouse pro-renin convert- 
ing enzymes, y-renin, and rat tonin (3-5). In man, the two other 
most well characterized enzymes, prostate-specific antigen, 
PSA (encoded by KLK3; Ref. 6), and hK2 (encoded by KLK2; 
Ref. 7), act on entirely different substrates. Nevertheless, each 
is involved in the selective cleavage of specific precursors to 
their bioactive or inactive forms (8). These findings highlight 
the involvement of the KLK family in a number of different and 
complex enzymic pathways that underlie several important 
biological functions. 

The wide range of potential functions of these enzymes is 
emphasized by their diverse, yet specific, tissue expression 
patterns. Whereas KLK2 and KLK3 are primarily expressed at 
high levels in the secretory epithelium of the prostatic ducts 
and have been implicated in normal prostatic function as well 
as in cancer progression (8), tissue kallikrein, through its mul- 
tifunctional roles, is involved in the (patho)physiology of the 
kidney, brain, and respiratory, gastrointestinal, and reproduc- 
tive tracts (2). This diverse functionality is underscored by the 
ubiquitous expression pattern observed for KLK1 in the rat (9). 
Although a function has not yet been ascribed to the protein 
products of most other rat KLK genes, an indication of the 
tissue in which they will be functional is gained from their 
tissue expression patterns (9). In man, the newly identified 
KLK serine proteases, Protease M/neurosin/zyme (10-12) and 
normal epithelial cell-specific 1 (Nesl) 2 (13), are highly ex- 



1 Approved nomenclature by the Human Genome Nomenclature 
Committee for the human kallikrein family: KLK1, KLK2, KLK3, 
iCUC4:Prostase KLKL1; KLK5:KLKL2; KLK6:Trotease M; KLK7- 
SCCE; jKLK&Neuropsin; KLK9:KLKL3; KLK10:Uesl; KUU1:TUSP; 
KLK12.KLKU; KLK13:KLK1A; KLK14:KLKL6. Since this nomencla- 
ture is very recent, old and new gene designations will be used concur- 
rently to avoid confusion. Details of the new nomenclature can be found 
on the Human Genome Nomenclature Committee Website and in Dia- 
mandis et al. (50). 

2 The abbreviations used are: Nesl, normal epithelial cell-specific 1; 
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pressed in brain and breast tissue, and have been suggested to 
be involved in Alzheimer's disease and breast cancer, respec- 
tively (12-14). Several other recently identified KLK-Uke ser- 
ine protease genes have also been shown, at the RT-PCR level, 
to be expressed in a tissue-specific manner, which presumably 
indicates the tissues in which they will be biologically active 
(15-20). 

The KLK1, KLK2, and KLK3 genes are clustered within a 
region that spans 60 kb on chromosome 19ql3. 3-13.4 (21, 22). 
This cluster maps to a position -55.5 Mb from the p-telomere 
on the metric physical map of the chromosome and at the 
proximal end of 19ql3.4. The larger size of the rat and mouse 
KLK gene families (13-24 genes) (23-25), which are also clus- 
tered in one locus (23, 25, 26), and the recent identification of a 
new KLK gene, KLK4 (prostase/KLKLl) (15, 21, 27) 25 kb 
downstream of the KLK2 gene (15, 21), suggested that the 
human KLK gene family would be larger than previously 
thought. Indeed, Southern blot analysis of a human genomic 
library, using a monkey KLK cDNA with 92-95% homology to 
human KLK cDNAs (28), indicated that there may be as many 
as 19 human KLK genes (29). Furthermore, a number oiKLK- 
related serine protease gene sequences have been detected that 
lie telomeric to the original KLK locus, thus expanding the 
human KLK gene family (30). However, the physical mapping 
of this locus has not yet been described and the definitive size 
of the family is not clear. In addition, the location of the various 
micrbsatellite markers in the KLK locus is not clear. 

In this study, we have physically mapped the serine protease 
genes clustered at the KLK locus and defined the position of 
key microsatellite markers. We have also confirmed the order 
and orientation of the 10 putative new members of the human 
KLK family which span approximately 300 kb across the KLK 
locus on chromosome 19ql3.4. We have further determined 
that no other serine protease genes are present within the 
regions surrounding the KLK locus. In addition, extensive gene 
expression studies have identified the most abundant sites of 
expression for these genes and a diverse tissue specificity of 
expression similar to the rodent KLK family. Also, phylogenetic 
analysis of the 14 serine proteases encoded by the KLK genes of 
this locus demonstrated the extent of divergence within the 
expanded family and suggested that there are at least four 
human KLK subfamilies. 

EXPERIMENTAL PROCEDURES 

Cosmid and BAC DNA Clones— Four cosmids (R32740, R31381, 
F22702, and R28781) and six BAC clones (BC772576, BC778306, 
BC33747, BC85745, BC349237, and BC892989), that spanned the clas- 
sical KLK locus (KLK1-3) and approximately 400 kb either side of this 
region, were used in the following studies. Five hundred-ml cultures of 
Luria-Bertani medium supplemented with 20 /ig/ml kanamycin or 15 
/Ag/ml chloramphenicol, were inoculated with the BAC and cosmid 
clones, respectively, and incubated overnight at 37 °C. DNA was then 
extracted using the alkaline lysis method, followed by RNase A diges- 
tion and ethanol precipitation. 

Physical Mapping of the KLK Locus on Chromosome 19q— Twenty jig 
of DNA from the set of 10 BAC and cosmid clones were digested 
overnight at 37 °C with the restriction enzyme JScoRI, then separated 
electrophoretically on a 1% agarose gel. The gel was treated with 1.5 M 
NaCl, 0.5 M NaOH for 30 min to denature the DNA, rinsed briefly in 
distilled water, followed by a 30-min incubation in neutralization buffer 
(1.5 M NaCl, 0.5 m Tris-HCl, pH 7.4). The DNA was capillary blotted 
onto Hybond N™ nylon membrane (Amersham Pharmacia Biotech) 
, overnight in 20 X SSC then cross-linked to the membrane by ultraviolet 
irradiation. Membranes were hybridized with a 32 P-random labeled 
cDNA probe (Ambion), or a 32 P-end labeled gene-specific oligonucleotide 
probe (Table I), or a 32 P-end labeled degenerate oligonucleotide probe to 



RT-PCR, reverse transcriptase-polymerase chain reaction; kb, kilo- 
base^); PSA, prostate-specific antigen. 



Tabus I 
Southern blot probes 
Southern blot analysis of chromosome 19q cosmid/BAC EcoBI-di- 
gested DNA was performed using 82 P-end labeled oligonucleotide 
probes for KLK5-8 and KLK10-12, and 32 P random-labeled cDNA 
probes for KLK9 and KLK13-14. Southern blot analysis of RT-PCR 
products was performed using a digoxigenin-labeled oh'gonucleofcide 
probe. The percent difference in sequence between an individual probe 
and each of the 14 genes for all probes used averaged 70%. 



Gene 



DNA/oligonudeotide probes 



KLK1 IKLK1 

KLK2/KLK2 

KLK3/KLK3 

KLK4/KLK4 

KLK5/KLKL2 

KLK6/protease M 

KLK7/SCCE 
/CkKS/Neuropsin 

KLK9/KLKL8 

KLKIO/Nesl 

KLKlimJSP 

KLK12/KLKL5 

KLK13/KUQA 



KLK14/KLKL6 



5^CTGGACGTGGOOTTTTTCGCAC-3' 
5 '-TCTAAGGCTTTGATGCTTCAG-3 ' 
5 '-GGTATTTCAGGTC AGCCACAG-3' 
5^CTACCGTGCTGCAGTGCGTG-3' 
5 ' - AGTGC ACTTGGGGGCTCTTGGTT-3 ' 
5'-TGTCCTGGTCATGGCTGGC-3' (RT-PCR) 
5 '-AATCACC ATCTGCTGTCTTGC-3 ' 
5 '-GCCCAGGTGC ACGGTGTACTC-3 ' 
5 '-C AATGATCTGCCAGGCTGA-3 ' (RT-PCR) 
5'-GAGCCCCAGGATGTGATGCCC-3' 
RT-PCR product (678 bp) 
5'-CCCCCAGCCTGAGATGAGACACT-3' 
5^GCCGTGGTGCCCCAGCCAGCA-3' (RT-PCR) 
5 '-GGTAAAC ACCCCACGAGAGGA-3' 
5 '-GGTGATGGAGACTGGCGA-3 ' 
5^AGGTCGT<X?rCGTGGCTCGTC-3' 
EST clone AA002101 EcoWHindlU digest 
(1500 bp) 

5 '-AGGGGTTAGGCGGTTGTTGTGG-3' 

EST clone AI935298 EcoWHindm digest (750 bp) 

5 '-TGTTCACGC ATTGC AGAGAGG-3 ' 



the highly conserved regions spanning either the catalytic histidine 
codon (GGTSCTS AC AGCY GCCC AYTG; 16-fold degeneracy; S = G or 
C, Y = C or T) or the the catalytic serine codon (CAGIGGICCIC- 
CIGARTCAAC; 512-fold degeneracy; I = inosine, r = A or G) charac- 
teristic of serine proteases. Hybridization was performed overnight in 
ExpressHyb™ solution (CLONTECH) at 42 or 55 °C for oligonucleotide 
and cDNA probes, respectively. Blots were washed in 1 X SSC, 0.5% 
SDS up to 55 °C, then exposed to x-ray film (AGFA Curix Ortho) at 
-80 °C. 

Computer Analysis of Chromosome 19 Sequences — Approximately 
320 kb of draft sequence of chromosome 19ql3.3-13.4 (available at the 
Lawrence Livermore National Laboratory and the DOE Joint Genome 
Institute website), beginning -50 kb upstream of where two of the 
original KLK genes (JSLK2-3) were localized and searched with the 
conserved peptide sequences spanning the catalytic histidine, aspar- 
tate, and serine residues of KLK serine proteases (WVLTAAHC, 
HDLMLLKL, and GDSGGPL; catalytic residue underlined). This was 
performed using the data base creating tool and the tBLASTN algo- 
rithm available at the Australian National Genome Information Ser- 
vice. In addition, the cDNA sequence of each of the serine proteases, 
KLK1 (accession number X13561), KLK2 (accession number S39329), 
KLK3 (accession number M26663), KLK4 (accession number- 
API 13 141), KLK5 1 (KLKL2, accession number AF135028), KLK6 (Pro- 
tease M, accession number U62801), KLK7 (SCCE, accession number 
L33404), KLK8 (neuropsin, accession number AB009849), KLK9 
(KLKL3, accession number AF135026), KLK10 (Nesl, accession num- 
ber AF024605), KLKll (TLSP, accession number AB012917), KLK12 
(KLKL5, accession number AF135025), KLK13 (KLKL4,accession num- 
ber AF135024), and KLK14 (KLKL6, accession number AF161221), was 
aligned against this genomic sequence using the BLASTN algorithm 
available at the National Center for Biotechnology Information website. 
BLASTN searches were also performed against the draft chromosome 
19ql3.3-13.4 sequence using microsatellite marker sequences between 
272.77 and 276.46CR3000 at interval D19S425-D19S418; Marker se- 
quences used were: WI-9055 and M21896, U37672, stSG39975, sts- 
S39329, WI-20869, stSG30247,- stSG32045, stSGl457, stSG3375, 
stSG372, sts-H71236, SGC35527, U37672, stSG30429, sts-AA028917, 
and L12214. The computer software DNAStar (Lasergene) was used to 
construct an EcoBX restriction enzyme map of this sequence. These data 
were then used in conjunction with the data obtained by physical 
mapping and the EcoBl map available at the Lawrence Livermore 
National Laboratory website to determine the exact position of each 
gene and to reconstruct a definitive EcoBl map of this region. 

Sequence and Phylogenetic Analysis—Multiple alignment of deduced 
protein sequences was performed using the Eclustal tool available at 
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Table II 
PCR primers 

D^r? 61 " PairS USed f ° r the detection of 14 KLK S enes bv RT-PCR and for the generation of DNA fragments for KLK6-11, used in poly(A) + 
RNA dot blot analyses. The percent difference in sequence between an individual primer and each of the 14 genes for all primers used averaged 
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PCR primers . 


Product size 


KLK1 IKLK1 


Forward 5 '-TGGAGAACC ACACCCGCCAAG-3 ' 


468 bp 


KLK2IKLK2 


Reverse 5 '-ACGGCGACAGAAGGCTTATTG-3 ' 


Forward 5 ' -GCCT AAAGAAGAATAGCC AGGT-3 ' 


375 bp 


KLK3IKLK3 


Reverse 5'-CTCAGACTAAGCTCTAGCACAC-3' 


Forward 5'-GCATCAGGAACAAAAGCGTGA-3' 


138 bp 


KLK4IKLK4 


Reverse 5'-CCTGAGGAATCGATTCTTCAG-3' 


Forward 5'-GCGGCACTGGTCATGGAAAAGG-3' 


526 bp 


KLK5/KLKL2 


Reverse 5'-CAAGGCCCTGCAAGTACCCG*3' 


Forward 5 ' -GAGCTGGGGCCGGGGAAGAC-3 ' 


657 bp 


KLK6/Prote3Lse M 


Reverse 5 '-TGGGCCGGGC AC AAGGGTAA-3 ' 


Forward 5 ' -GAGCGGOC ATGAAGAAGC-3' 


459 bp 


KLK7/SCCE 


Reverse S'-AATCACCATCTGCTGTCTTGT-T 


Forward 5'-GCCCAGGGTGACAAGATTATT-3' 


569 bp 


jnJLtj\.of iN europsin 


Reverse 5'-GTACCTCTGCACACCAACGG-3' 


Forward 5'-TACTCTGTGGCGGTGTCCTTG-3' 


523 bp 


KLK9/KLKL3 


Reverse 5 ' -GAGCCCC AGGATGTGATGCCC-3 ' 


Forward 5'-GGCCGGCCTCTTCCACCTTAC-3' 


678 bp 


KLKIO/Nesl 


Reverse 5'-GCGCGGGCTCAGTTCTCCAT-3 ' 


Forward 5 ' -GCGGAAAC AAGCCACTGTGGG-3 ' 


486 bp 


KLK11/TLSP 


Reverse 5'-GGTAAACACCCCACGAGAGGA-3' 


Forward 5 '-CCGCTAC ATAGTTC ACCTGG~3 ' 


284 bp 


KLK12/KLKL5 


Reverse 5'-AGGTGTGAGGCAGGCGTAACT-3' 


Forward 5'-TGGCAGACAAAGAGACAAGGT-3' 


944/815 bp 


KLK13/KLKL4 


Reverse 5'-CTTAGAAGGGCTGGCAGGAG-3' 


Forward 5'-CTACACCTGCTTCCCCCACTCTCA-3' 


617 bp 


KLK14/KLKL6 


Reverse 5 '-GCCGGTCAGGTTGCCCAC AT-3 ' 


Forward 5 '-CTGGGCAAGCACAACCTGAG-3 ' 


517 bp 




Reverse 5'-GCATCGTTTCCTCAATCCAGC-3' 



J 

Australian National Genome Information Service. The percentage iden- 
tity between each of the 14 kallikreins was calculated using the Homol- 
ogies tool on the Australian National Genome Information Service 
website. Phylogenetic studies were also performed via the Australian 
National Genome Information Service website. A distance matrix be- 
tween the structures of the proteases was computed from the alignment 
using the Dayhoff program of the Phylip package (31). The Neighbor- 
joining method was then used to construct a tree from the distance 
matrix. 

Poly(A) + RNA Dot Blot Analysis— For KLKl-4, 32 P-end labeled oli- 
gonucleotides were used to probe a CLONTECH Master blot containing 
poly(A) + RNA from 50 different human tissues. Hybridizations were 
performed overnight in ExpressHyb™ solution (CLONTECH) at 42 *C. 
For KLK6-11 DNA fragments used in the poly(A) + RNA dot blot anal- 
yses were obtained by RT-PCR using the primers detailed in Table II. 
RT-PCR was performed as described below for the screening of the 
cancer cell lines. Amphcons were purified using a PCR purification kit 
(Qiagen) and ligated into pGEM-T or pGEM-T Easy (Promega). DNA 
insert orientation was examined by sequencing and restriction enzyme 
digestion analysis. For KLK5 and KLK12-14, probes were generated 
from EST clones accession number W07551, accession numbers 
AI394679, AA002101, and AI935298 (Genome Systems Inc), respec- 
tively. Prior to radioactive labeling each construct was linearized before 
transcription using T7, T3, or SP6 RNA polymerase as appropriate to 
generate an antisense cRNA. Radiolabeled cRNA probes were gener- 
ated from these clones by incorporation of [ 32 PJUTP (Geneworks) using 
a StripEz™ kit (Ambion) and hybridized, in Ultrahyb™ solution (Am- 
bion), to a Multiple Tissue Expression array (CLONTECH) containing 
poly(A) + RNA from 76 different human tissues and cell lines at 65 °C 
overnight. All blots were washed in 0.1 x SSC, 0.1% SDS up to 70 °C, 
and signals were detected by exposure of the blot to x-ray film (AGFA 
Curix Ortho) for 1-7 days at -80 °C. Control blots, containing DNA for 
each of the 14 kallikreins, were included in each hybridization to 
confirm the specificity of the radiolabeled probes. 

Cell Lines and Tissue Samples — The following cell lines were ob- 
tained from the American Tissue Culture Collection: T47D (breast 
cancer cell line); HaCAT (keratinocyte cell line); OVCAR-3 (ovarian 
cancer cell line); HEC1A, Ishikawa, and KLE (endometrial cancer cell 
lines); and DU145, PC3, and LNCaP (prostate cancer cell lines). An 
additional prostate cell line, ALVA41, was obtained from Dr. A. Nakla, 
St Luke's Hospital, New York. The normal kidney tissue samples and 
the renal carcinoma cell lines, Cakil and SN12kl, were a gift from Dr. 
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BC772576 
BC778306 
R32740 
R31381. 
BC33747 
, F22702 
R28781 
BC85745 
BC349237 
BC892989 



Fig. 1. Identification of JKZJT-like genes by Southern blot anal- 
ysis. £coRI-digested BAC and cosmid DNA was hybridized with a 
32 P-end labeled degenerate oligonucleotide to the highly conserved re- 
gion spanning the catalytic histidine of serine proteases. The blot was 
washed in 0.1 x SSC, 0.1% SDS at 55° then exposed to film for 13 days. 

David Nicol, Princess Alexandra Hospital, Brisbane, Australia. The 
prostate tissue samples were obtained during surgery by Dr. Robert 
Gardiner, Royal Brisbane Hospital, Brisbane. The ovarian cancer cell 
line, PEOl, was obtained from Dr. Michael McGuckin, Mater Medical 
Research Institute, Brisbane. Salivary gland tissue total RNA was 
purchased from CLONTECH and the human salivary gland cell line 
was obtained from Dr. Ray MacDonald, University of Texas, South- 
western Medical Center, Dallas, TX. All cell lines were cultured in 
Dulbecco's modified Eagle's medium with 10% deactivated fetal calf 
serum and supplemented with 50 units/ml penicillin G and 50 /xg/ml 
streptomycin (CSL Biosciences) and incubated at 37 °C with 5% CO a . 

RT-PCR and Southern Blot Analysis— Total RNA was extracted from 
the cell lines using Trizol reagent (life Technologies, Inc.) following the 
manufacturer's instructions. Two pug of total RNA was reverse-tran- 
scribed into first-strand cDNA using Superscript II (Life Technologies^ 
Inc.) in a 20-jU reaction. The cDNA was subsequently diluted 5-fold and 
PCR was then performed in a reaction mixture containing i pi of cDNA, 
25 ng of each primer (Table ID, 0.1 mM dNTPs (containing dATP, dCTP, 
dGTP, and TTP), 2 of 10 X reaction buffer (Roche Molecular Bio- 
chemicals), 0.5 units of platinum Taq (Life Technologies), and sterile 
water to 20 pi. The cycling conditions were 94 °C for 5 min to activate 
the polymerase, followed by 35 cycles of 94 °C for 1 min, 62-68 °C for 1 
min (depending on the target gene), 72 °C for 1 min, and a final exten- 
sion at 72 °C for 7 min. Ten pi of the PCR mixture was electrophoresed 
on a 2% agarose gel and visualized by ethidium bromide staining. The 
resulting amplicons were analyzed by Southern blot hybridization us- 
ing digoxigenin 3' end-labeled oligonucleotide probes (Table I), internal 
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Fig. 2, Fine mapping of KLK5-14 to 
BAC and cosmid clones spanning the 
KLK locus at 19ql3.4. .EcoRI-digested 
BAC and cosmid DNA was hybridized 
with S2 P-end labeled oligonucleotides or 
S2 P-random labeled cDNAs as described 
in the text and detailed in Table L 
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Fig. 3. Location of the KLK locus at chromosome 19ql3.3-13.4. A, schematic representation of the interval between D19S425 and D19S418 
including 19ql3.3-13.4. The KLK locus is located proximal to DI9S418. B, BAC and cosmid clones spanning the ia/Hocus. The position of the 14 
kalhkrein encoding genes are marked. C, in silico generated EcoRI digest data for BC85475 and BC349237 containing KLK4 to KLK14. Fragments 
marked with an asterisk are either incomplete end fragments or have not been clearly determined because of the provisional status of the sequence, 
TJe size of fragments obtained by Southern blot analysis data (see text and Fig. 2) is consistent with the in silico data. The position and orientation 
of KLKX to KLK4 on cosmids F24242, F25479, F22702, and R28781 have been reported previously (21). KLK4 to KLK14 are transcribed telomere 
to centromere as indicated by the arrow at the bottom of the figure. The position of microsatellite markers are indicated in B and C. 



to the PCR primers used, in EasyHyb™ solution (Roche Molecular 
Biochemicals) overnight at 37 °C. Washes were performed in 0.2 X SSC, 
0.1% SDS at 37 °C. In addition, for each kallikrein cDNA primer pair at 
least one band of the predicted size was excised from a second 2% 
agarose gel following electrophoresis. The DNA recovered from each 
band was sequenced to determine the specificity of the PCR products. 

Gene-specific Oligonucleotides— Gene-specific olionucleotides, used 
as probes in the genomic and/or RT-PCR Southern blot analyses (Table 
I) were selected from regions with maximum dyshomology among the 
kallikreins based on a multiple sequence alignment of the 14 cDNAs. 



The primers used in the PCR reactions (Table ID were designed to span 
at least one intron for each of the 14 genes. The primer pairs were 
chosen for their compatibility (i.e. no hairpin stem loops and similar 
melting temperatures) and were found to be specific for the gene in 
question upon BLAST analysis. In addition, the percent difference for 
each primer/probe as compared with those of all the 14 genes was 
calculated and found to range from 50 to 95%. The average percent 
divergence for the oligonucleotides used as probes in the Southern blot 
experiments was 70% whereas the average for the primers used in the 
PCR was 74%, 
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Table III 

Percentage homology betiveen each member of the human kallikrein family at the protein level 



hKl 

hK2 

hK3 

hK4 

hK5 

hK6 

hK7 

hK8 

hK9 

hK10 

hKll 

hK12 

hK13 

hK14 



hKl 




nt\o 


niv4 


LITE 


hK6 


bK7 


hK8 


hK9 


hKlO 


hKll 


hKl2 


hKl3 


hKl4 


100 


67 


62 


34 


31 


37 


37 


36 


35 


31 


36 


27 


37 
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Fig. 4. Phylogenetic tree of the ex- 
panded KLK family. The tree was gen- 
erated from an Eclustal multiple se- 
quence alignment of the 14 kallikrein 
proteins using the Dayhoff program of the 
Phylip package. 
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RESULTS 

Characterization of an Extended KLK Locus at Chromosome 
19ql3.4 — To determine if there were additional kallikrein 
genes surrounding the original KLK locus on chromosome 
19ql3.3-13.4, Southern blot analysis was performed using de- 
generate oligonucleotides to consensus sequences spanning the 
catalytic histidine and serine residues characteristic of serine 
proteases. A combination of 10 BACs and cosmids, which span 
400-500 kb on either side of the original KLK60-kb locus, were 
digested with J£coRI and subsequently hybridized separately 
with each of the 32 P-labeled degenerate oligonucleotides. The 
autoradiograph resulting from hybridization with the degener- 
ate "histidine" probe demonstrated positive signals within five 
of the BACs and cosmids (Fig. 1). Three of these clones 
(BC33747, F22702, and R28781) contain the genes, KLK1-4 
(21). However, a further two BACs (BC85745 and BC349237) 
also showed strong hybridization to these probes. These were 
not known to contain any serine protease genes. BC892989, 
which lies telomeric to the KLK cluster and the 4 clones cen- 
tromeric to the cluster (R31381, R32740, BC778306, and 
BC 772576) did not produce any positive signals even though 
the membrane was exposed for 13 days. The degenerate "ser- 
ine" oligonucleotide probe could not be used successfully in 
these experiments, probably because of its degeneracy (512- 
fold). Thus, additional serine protease genes are only found 



within a region encompassed by BACs BC85745 and 
BC349237, and there are no other serine protease genes ap- 
proximately 200 kb telomeric of BC349237 and 400 kb centro- 
meric of the original KLK1S cluster. 

In order to identify the serine protease genes which were 
detected using the degenerate probe described above, the same 
set of ZfcoRI-digested BACs and cosmids were examined using 
one or two gene-specific oligonucleotides or a cDNA probe to 
each of the KLK5-14 genes. The KLK5 probe hybridized to a 
14.5-kb fragment on BC85745 (Fig. 2). Positive signals were 
detected on BC85745 at 10 and 6.5 kb for KLK6 and at 15 kb for 
KLK7. A signal was observed for KLK8 on both BC85745 and 
BC349237 at 10.5 and 2 kb, respectively, and KLK9 was found 
to be located within three of the EcoEL fragments at 10, 3.5, and 
1.8 kb on BC349237. Two bands at 18 and 10 kb were detected 
for KLK10 on BC349237 whereas only the 18-kb band was 
positive for KLK11. The remaining three iCLiiL-like genes were 
all present on BC349237. KLK12 was present on an 18-kb 
band. Although only one signal (at approximately 5 kb) was 
detected for KLK13, this band represents two individual EcoRI 
fragments of 5.7 and 4.6 kb in size. The KLK14 cDNA probe, 
hybridized to an 18- and a 12-kb band. 

Concurrent with the physical mapping studies, computer 
analysis was performed on approximately 320 kb of draft 
genomic sequence spanning the KLK locus on chromosome 19q. 
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Fig. 5. mRNA expression patterns. Membranes containing poly(A) + from human tissues and cell lines were screened with S2 P-labeled probes 
to each of the 14 KLK genes. Autoradiographs representative of the patterns obtained are shown. A KLK1. B, KLK2-4. C t KLK10. D, KLK11, 
Tissues arrayed on the membranes in panels A and B: Al, whole brain; A2, amygdala; A3, caudate nucleus; A4, cerebellum; A5, cerebral cortex* 
A6, frontal lobe; A7, hippocampus; A8, medulla oblongata; Bl, occipital lobe; B2, putamen; B3, substantia nigra; B4, temporal lobe; B5, thalamus; 
B6, nucleus accumbeus; B7, spinal cord; CI, heart; C2, aorta; C3, skeletal muscle; C4, skeletal muscle; C5, bladder; C6, uterus; C7, prostate; 08^ 
stomach; Dl, testis; D2, ovary; D3, pancreas; D4 pituitary gland; D5 adrenal gland; D6, thyroid gland; D7, salivary gland; D8, mammary gland- 
El, kidney; E2, liver; E3, small intestine; E4, spleen; E5 ( thymus; E6, peripheral blood leukocyte; E7, lymph node; E8, bone marrow; Fl, appendix; 
F2, lung; F3, trachea; F4, placenta; Gl, fetal brain; G2, fetal heart; G3, fetal kidney; G4, fetal liver; G5, fetal spleen; G6, fetal thymus; G7 fetal 
lung; HI, yeast total RNA, 100 ng; H2, yeast tRNA, 100 ng; H3, Escherichia coli rRNA, 100 ng; H4, E. coli DNA, 100 ng; H5 t Poly r(A) 100 ng; H6, 
human C 0 t-1 DNA, 100 ng; H7, human DNA, 100 ng; H8, human DNA, 500 ng. Tissues arrayed on the membranes in panels C and D: Al, whole 
brain; A2, cerebellum, left; A3, substantia nigra; A4, heart; A5, esophagus; A6, colon; A7, kidney; A8, lung; A9, liver; A10, leukemia, HL-60; All 
fetal brain; A12, yeast total RNA; Bl, cerebral cortex; B2, cerebellum, right; B3, accumbens nucleus; B4, aorta; B5, stomach; B6, colon, descending*' 
B7, skeletal muscle; B8, placenta; B9, pancreas; B10, HeLa S3; Bll, fetal heart; B12, yeast tRNA; CI, frontal lobe; C2, corpus caUosum; C3* 
thalamus; C4, atrium, left; C5, duodenum; C6, rectum; C7, spleen; C8, bladder; C9, adrenal gland; C10, leukemia, K-562; Cll, fetal kidney; C12,' 
E. coli rRNA; Dl, parietal lobe; D2, amygdala; D3, pituitary gland; D4 atrium, right; D5 jejunum; D7, thymus; D8, uterus; D9, thyroid gland* D10* 
leukemia MOLT-4; Dll, fetal liver; D12, E. coli DNA; El, occipital lobe; E2, caudate nucleus; E3, spinal cord; E4, ventrical, left; E5, ileum; E7 
peripheral blood leukocyte; E8, prostate; E9, salivary gland; E10, Burkitt's lymphoma, Raji; Ell, fetal spleen; E12, Poly r(A); Fl, temporal lobe* 
F2, hippocampus; F4, ventrical, right; F5, ilocecum; F7, lymph node; F8, testis; F9, mammary gland; F10, Burkitt's lymphoma, Daudi; Fll, fetal 
thymus; F12, human C 0 t-1 DNA; Gl, cerebral cortex; G2, medulla oblongata; G4 interventricular septum; G5, appendix; G7, bone marrow* G8 
ovary; G10, colorectal adenocarcinoma, SW480; Gil, fetal lung; G12, human DNA 100 ng; HI, pons; H2, putamen; H4, apex of heart* H5 colon 
ascending; H7, trachea; H10, lung carcinoma, A549; H12, human DNA 500 ng. ' 



To identify novel protease genes in this region, searches of the 
genomic sequence were performed using the consensus se- 
quences spanning each of the histidine, aspartate, and serine 
residues necessary for the enzymatic activity of serine pro- 
teases. These searches identified 10 potential serine protease 
genes in addition to known kallikrein genes, KLK1-4. Further 
searches were then performed with the sequence for each of the 
KLK-like serine protease genes: Protease M (KLK6\ SCCE 
(KLK7), neuropsin (KLK8\ Nesl (KLK10\ TLSP (KLK11), 
KLKL2 (KLK5), KLKL3 (KLK9), KLKL4 (KLK13\ KLKL5 
(KLK12), and KLKL6 (KLK14\ using the BLASTN algorithm. 
All 10 of these genes were identified within this sequence and 
were found to correspond to the genes identified from searches 
using the three serine protease consensus sequences. The 
genes were aU located in a telomeric position with respect to 
the KLK2-4 genes. The KLK1 gene is located -31 kb centro- 
meric of KLK2-4 (21, 22); sequence data is not available for 
this region. Also, an EcoRl restriction map of the sequence 
contained within BACs BC85745 and BC349237 was con- 
structed in silico. This map was used in conjunction with the 



physical mapping data and the EcoBI map available at the 
Lawrence Livermore National Laboratory website, to deter- 
mine the exact position of each gene and to reconstruct a 
definitive EcoRI map of this region (Fig. 3). The size of each 
of the bands identified by Southern blot analysis (Fig. 2) 
agreed with the in silico generated -EcdRI fragment sizes. From 
the map, the order of the expanded KLK locus (according to the 
old nomenclature) is centromere-KLKl-KLK3-KLK2-KLK4- 
KLKL2-Protease M-SCCE -neuropsin- KLKL3-Nes 1-TLSP- 
KLKL5-KLKL4-KLKL6-telomere (new nomenclature: centro- 
mere-KLKl~KLK3-KLK2-KLK4- to 4 -telomere). Analysis 

of the contiguous sequence indicated that KLK5 to KLK14 are 
transcribed telomere to centromere (Fig. 3). 

Clarification of the Position of Microsatellite Markers within 
the KLK Locus at Proximal 19q 13. 4— To clarify the order and 
position of the microsatellite markers within the KLK locus, 
microsatellite marker sequences were aligned against the chro- 
mosome 19q proximal 13.4 draft sequence data using the 
BLASTN algorithm. For this purpose, a data base of contiguous 
sequence was established using the resources available at the 
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Table IV 

Summary of mRNA expression patterns 
Membranes containing po!y(A)" from human tissues and cell lines were screened with S2 P random labeled probes to each of the 14 kallikreins. 



Gene 



High expression 



Moderate expression 



Gene 



High expression 



Moderate expression 



KLK1 


Kidney 




KLK8 


Pancreas 


Duodenum 




Pancreas 




( NptiiwnQin^ 




Liver 




Salivary gland 






Esophagus 
Stomnph 


KLK2 


Prostate 




KLK9 


Pancreas 


Salivary gland 


KLK3 






(KLKL3) 






Prostate 




KLKIO 


Pancreas 


Duodenum 

i r<iciic<i 


KLK4 


Prostate 








Colorectal adenocarcinoma 




(TLSP) 


Pancreas 


Heart 

Salivary gland 

Stomach 

Liver 


KLK5 


Testis 








Skeletal muscle 


Salivary gland 


KLK12 


Pancreas 


Brain 


(KLKL2) 


Mammary gland 


Esophagus 


(KLKL5) 




Duodenum 


KLK6 


Pancreas 


Spinal cord 


KLK13 


Pancreas 


Appendix 
Duodenum 


(Protease M) 




Brain 


(KLKL4) 


Esophagus 


Stomach 






Testis 




Appendix 


Brain 






Appendix 




Testis 






Colorectal adenocarcinoma 






Prostate 


KLK7 


Pancreas 




KLK14 


Brain 


Salivary gland 
Liver 


(SCCE) 






(KLKL6) 


Bone marrow 
Fetal liver 


Pancreas 
Fetal spleen 
Prostate 



Australian National Genome Information Service website. 
Marker sequences between 272.77 and 276.46cR3 000 at interval 
D19S425-D19S418 on chromosome 19 were aligned with the 
nucleotide sequence within the data base. The data obtained 
from these searches was used to re-evaluate the order and 
presence of these markers within the region of the KLK locus 
(Fig. 3). Markers WI-9055 and M2 1896, previously recognized 
as partial KLK3 sequence, as well as U37672 (previously un- 
known) were all found to lie within the KLK3 sequence on 
cosrnid F25479. The two markers previously identified as par- 
tial KLK1 sequence, stSG39975 and sts-S39329, are actually 
partial sequences of the KLK2 gene and were found within the 
sequence of R33359 which overlaps cosrnid F22702 at this 
location. The marker, WI.20869, recognized as KLKG partial 
sequence, was confirmed to lie within the KLK6 coding se- 
quence on BC85745. Similarly, stSG30247 was shown to He 
within the KLKIO sequence on BC349237. In contrast, none of 
the remaining microsatellite markers listed as being located in 
the interval D19S425-D19S418 were detected within the avail- 
able genomic sequence of the extended KLK locus. These mark- 
ers are stSG32045 (EST), stSG1457 (/3-Electron transfer fla- 
voprotein), stSG3375 (OTK18 mRNA), stSG372 (CD33 
antigen), sts-H71236 (EST), SGC35527 (granulocyte colony 
stimulating factor-induced gene), U37672 (unknown), 
stSG30429 (EST), sts-AA028917 (EST, highly similar to zinc 
finger protein 91), L12214 (unknown). Prior mapping work 
shows genes CD33 and ETFB to be distal to the KLK locus by 
—40 and —250 kb, respectively. 

Divergence of Serine Protease Genes within the Extended 
KLK Locus — Protein alignments were performed to establish 
the degree of similarity and divergence of the 14 serine prote- 
ase genes located within the expanded KLK locus. The percent- 
age homology between the proteins encoded by the 14 KLK 
genes was also calculated (Table III). The classical KLK family 
members, hKl, 3 hK2 and PSA (encoded by KLK1, KLK2, and 



KLK3 P respectively), display a high degree of homology (62- 
77% identity) at the protein level. hK4, is less similar with only 
34% identity to PSA. As shown in Table HI, hK4 possesses a 
similar degree of homology to the more recently identified 
serine proteases hK5-14 (25-45%) as it does to PSA. Similarly, 
the degree of homology between the protease groups, hK5-14 
and hKl-3, ranges from 26 to 41%. Phylogenetic analysis of the 
14 protein sequences produced similar results to the homology 
studies (Fig. 4). The phylogenetic tree demonstrated that IiKl, 
hK2, and PSA lie on one branch, as do hK4, hK5, and hK7. The 
two serine proteases, hK10 and hK12, were clustered on a third 
branch while hK9 and hKll were found on a fourth branch. 
However, the remaining four members (hK6, hK8, hK13, and 
hK14) were located around the base of the tree. Due to the low 
branch numbers assigned to these four kallikreins, it is difficult 
to precisely determine their degree of divergence from the 
family as a whole. Nevertheless, the high branch numbers 
observed overall suggest that this expanded family of human 
kallikreins consist of at least four subfamilies. 

mRNA Expression Patterns — To establish a comprehensive 
pattern of expression for the 11 KLK genes, KLK4-14,. as 
compared with that known for KLK1-3, arrays containing 
poly(A) + RNA from either 50 or 76 different human tissues and 
cell fines were screened using 32 P-labeled probes to each of the 
14 KLK serine protease genes. Examples of the autoradio- 
graphy obtained are shown in Fig. 5 and a summary of the 
expression levels is shown in Table IV. The expression patterns 
observed for KLK1-4 were as previously reported, where KLK1 
was detected in the kidney, pancreas, and salivary gland while 
KLK2-4 were present only within the prostate (27, 32, 33). 
High expression of the KLKIO gene was observed in the pan- 
creas and moderate expression in the esophagus, duodenum, 
trachea, and colorectal adenocarcinoma. Low expression was 
seen in tissues such as the fiver, kidney, ovary, testis, prostate, 
and salivary gland. High expression of KLKll was also de- 



3 Kallikrein family protein nomenclature used here is that described by Berg et al. (3), e.g. protein hKl corresponds to gene KLKl, 
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Fig. 6. Expression pattern analysis by RT-PCR. Analysis was 
performed on total RNA isolated from the displayed cell lines and 
tissues using the primers listed in Table II. RT-PCR products were 
transferred to nylon membranes and probed by Southern blot analysis 
using digoxigenin-labeled oligonucleotide probes (Table I) internal to 
the PCR primers used. SG, salivary gland. 



tected in the pancreas. Moderate expression of KLK11 was 
observed in the heart, salivary gland, stomach, liver, and skel- 
etal muscle, while low expression was observed in most of the 
remaining tissues represented on the blot. Of interest, is the 
overall high expression of nearly all of these new KLK genes in 
the pancreas, particularly KLK6 to KLK13. Furthermore, mod- 
erate expression was observed for several genes in the salivary 
gland, esophagus, trachea, brain, gastrointestinal, and repro- 
ductive organs. 

In addition, RT-PCR was performed using total RNA ex- 
tracted from a number of different transformed and cancer cell 
lines as well as normal and cancer tissues. To increase the 
sensitivity of detection, Southern blot analysis using gene- 
specific oligonucleotide probes was performed on the PGR prod- 
ucts as shown in Fig. 6 and summarized in Table V. To confirm 
the specificity of the PCR for each of the 14 genes, bands of the 
predicted size were sequenced and found to correspond to the 
gene of interest (see Table V). From these results, it can be seen 
that each gene displays a distinctive expression pattern with 
several genes (KLK1, KLK6, KLK7, KLK9, KLK10, KLK11, 
KLK13, and KLK14) being expressed in a broad range of tis- 
sues/cell lines whereas other genes (KLK2-5, KLK8, and 
KLK12) are less widely expressed. Of interest is the high ex- 
pression of many of the newer KLK genes (KLK5-14) in kera- 



. tinocyte (HaCAT), ovarian (OVCAR-3 and PEOl), breast 
(T47D), and endometrial (KLE) cancer cell lines. In addition, 
all genes (except for KLK12) appear to have varying levels of 
expression in one of the four prostate cancer cell lines. Further- 
more, several of these genes are moderately expressed in the 
salivary gland and kidney. Additional bands were observed for 
several of the genes being examined. Some of these alternate 
bands were sequenced and found to represent variants of the 
gene in question (data not shown). Similarly, other studies 
have identified alternatively spliced transcripts for these new 
KLK genes (19). 

DISCUSSION 

The tissue kallikrein genes, particularly those in rodents, 
represent a distinctive family of a large number of genes, which 
are highly conserved in sequence and genomic organization. 
Nevertheless, the encoded enzymes exhibit specific tissue ex- 
pression patterns and diverse biological functions. We have 
now determined that the size of the human KLK gene family, 
like its rodent counterparts, is large and it is likely to consist of 
14 genes. Physical mapping of these genes at chromosome 
19ql3.4 concurs with recent computer analysis of draft se- 
quence data of this region (30). The tissue-specific patterns of 
expression observed for these 14 genes shows, again like ro- 
dents, a diverse expression profile for individual members of 
the family. Of interest, however, is that the human family has 
diverged further in sequence. 

The rodent KLK families are very large with 13 genes in the 
rat and 24 genes in the mouse (23-25) and both cluster to one 
chromosomal loci (23, 25, 26). The original human KLK family 
(KLK1-3), also clustered to one loci (21, 22), has a high degree 
of similarity to the rodent family with 65-72% identity at the 
nucleotide level. Due to the similarity between these KLK 
families and the previous report by Murray et aL (29), which 
proposed the existence of up to 19 human KLK genes by screen- 
ing human genomic DNA with a monkey cDNA, we began the 
search for new members of the human KLK family. In addition, 
previous studies placed the genes encoding the serine proteases 
Protease M and Nesl (KLK6 and KLK 10 respectively), on chro- 
mosome 19q in a region to which the original kallikreins, 
KLK1-3, also map (10, 13). 

Our initial studies involving Southern blot analysis identi- 
fied several potentially novel serine protease genes within the 
surrounding regions of chromosome 19q. Further investigation 
of these results identified five published serine protease genes 
(previously named Protease M, Nesl, TLSP, SCCE, and Neu- 
ropsin) on the BACs and cosmids containing the human KLK 
genes, KLK2-4. All five genes were detected telomeric to KLK4 
and thus were considered to be candidates for new members of 
the human KLK family. Furthermore, several other KLK-re- 
lated sequences, KLK5, KLK9, andKLK12-14, were also phys- 
ically mapped to 19ql3.4. This confirmed the recently pub- 
lished data identifying the position of these genes on 
chromosome 19, by computer analysis (30). In addition, as the 
data obtained by Southern blot analysis with a degenerate 
probe to the histidine-encoding region of serine proteases, de- 
tected unaccounted for signals only within the two BACs adja- 
cent to the classical KLK locus (BC85745 and BC349237) and 
no signal from DNA, both telomeric to these two BACs or 
centromeric to the original KLK cluster, it would appear that 
there are only 10 additional serine protease genes within the 
surrounding regions of the classical KLK locus. Li accordance 
with our data, the next gene telomeric from KLK14 (KLK-L6) is 
an unrelated gene, OB-BP-like, and encodes a putative leptin- 
binding protein (34). Accordingly, although sequence is not yet 
available for the region centromeric to KLK1, it is likely the 
complete KLK locus has now been identified. 
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Table V 

Expression patterns of the 14 KLK genes obtained by RT-PCR analysis 
The following symbols used represent: + + +, high expression; ++, moderate expression; +, low expression; (♦) PCR products of the predicted 
size sequenced and confirmed to be the correct sequence. 
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Comparison of the rat KLK gene sequences indicates con- 
certed evolution of the family that results in sequence homog- 
enization of all family members (35). In addition, there is a 
clear conservation of genomic organization with 5 coding exons 
for all KLK genes, although some of the newer family members 
have additional exons containing 5 '-untranslated sequences (6, 
7, 10, 13, 15-21, 23, 27). Characterization of the mouse and rat 
KLK genes showed that nucleotide sequence identity exceeds 
85% and extends throughout the coding regions, introns, and 
gene flanking sequences (23, 24, 35). The first characterized 
human KLK family members, KLK1-3, also show high se- 
quence homology, at both the nucleotide and protein level (62- 
77%) (6, 7,23,32). However , in man the level of homology drops 
sharply for kallikrein genes telomeric of KLK2-3. The next 
closest gene along the locus, KLK4 (15, 21), is 34% identical to 
KLK3 (15, 21, 27). Similarly, as reported here and by others, 
the newer members of the family (KLK5-14) are less conserved 
with 25-49% similarity at the protein level, both between each 
other and with respect to KLK1-KLK4 ! (10, 13, 15-20). This 
divergence was also evident from our phylogenetic analysis. 
The degree of divergence seen within the human KLK family 
tree suggests that there are a number of subgroups within this 
species. The mouse Klk and original human KLK gene families 
were shown to lie in syntenic regions on chromosomes 7 and 
19q, respectively (21-23). Of interest, the putative mouse or- 
thologue of one of the newer human KLK genes, KLK8 (neu- 
ropsin), also maps to this region on chromosome 7 (36). 

Analysis of microsatellite markers within the KLK locus 
showed that the current order needed to be revised. Alignment 
of each of the markers with the genomic sequence for chromo- 
some 19q revealed that a number of the genes listed were in 
fact not found within the sequence available spanning the KLK 
locus. Nevertheless, those markers representative of partial 
KLK2, KLK3, KLK6, and KLK10 gene sequences were indeed 
detected within the genomic sequence. Thus, the listing of the 
microsatellite markers within the KLK locus has now been 
further clarified. 

Poly(A) + RNA dot blot analysis indicated the prostate-re- 
stricted expression of KLK2-4 and abundant expression of 
KLK6-13 in pancreas. Of interest, in an analogous fashion to 
the mouse and rat clustered Klk I KLK genes which are predom- 
inantly expressed in the salivary gland (9, 23), KLK2-4 are 
clustered together within the human locus as are KLK6-13. 
Furthermore, in man the two genes displaying high expression 
in the testis/mammary gland and brain/bone marrow/fetal 



liver, KLK5 and KLK14, respectively, are located either side of 
the cluster of pancreatic-expressed genes. As the KLK5 gene 
lies adjacent to the KLK2-4 cluster, also found in a hormone- 
dependent tissue, these four genes may also share common 
transcriptional control elements. As has been suggested for the 
rat kallikreins (9, 24, 35), this clustering of genes having com- 
mon restricted expression patterns likely indicates evolution- 
ary conservation of elements conferring tissue specificity stich 
as duplicated promoter elements or a locus control region. 
Whether KLK transcriptional control in man is conferred 
through either or both of these elements is not yet known. 
Certainly our phylogenetic studies indicate that the human 
kallikreins have diverged to a much greater extent than the 
comparably sized rodent families. This observation may indi- 
cate that duplicated promoter elements would not be expected 
to have survived the apparent evolutionary pressure to diverge 
in man or, alternatively, may account for variations in expres- 
sion patterns. Of interest, a pancreatic-specific-like sequence, 
similar to that identified in other rat pancreatic serine pro- 
teases (37, 38), has been identified in the promoter of the rat, 
mouse, and human KLK1 genes (24). In contrast, duplicated 
promoter regions have already been identified in the KLK2 and 
KLK3 genes (39, 40). In addition, although expression profiles 
of rat kallikreins show that individual genes are expressed in 
diverse tissue-specific patterns there is still high conservation 
of potential transcriptional regulatory regions (24, 25). 

Our RT-PCR analysis of the cell line expression patterns of 
KLK5-14 indicated that most of these genes display significant 
expression in keratinocytes, breast, ovarian, and endometrial 
cancer cell lines and in salivary gland tissue and a salivary 
gland cell fine. Expression of these KLKs in cancer-derived and 
transformed cell lines is not surprising, and concurs with other 
studies (10, 11, 13-21, 27, 34, 43), as proteases are well known 
to aid in malignant processes such as invasion, metastasis, and 
proliferation (41, 42). It.will be of interest to determine whether 
any of the kallikreins are up-regulated in tumors and the role 
of these proteolytic enzymes in tumor progression. 

Although the enzymic function of these newer kallikrein 
enzymes is yet to be elucidated, an indication of the tissue in 
which they will be functional may be gained from the patterns 
of expression noted above. From several in vitro biochemical 
studies, PSA and hK2 are known to degrade extracellular ma- 
trix proteins and PSA degrades the insulin-like growth factor- 
binding protein 3 to regulate the bio-availability of insulin-like 
growth factor 1; a key factor in cancer cell proliferation (8, 44). 
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The hK2 enzyme is likely the physiological activator of PSA (8) 
and hK2 can also activate urokinase plasminogen activator 
(45), another key enzyme in matrix degradation, demonstrat- 
ing that involvement in an enzymic cascade may also be an 
important function of this family of enzymes. Both PSA and 
hK2 are useful diagnostic and prognostic markers in prostate 
and breast cancer (8, 34). Thus, many of the newer members of 
the KLK family which are all expressed in a variety of cancer 
cell lines may have similar functional roles in these diseases. 
Similarly, other new KLK members, which are highly ex- 
pressed in skin and brain, have been suggested to play impor- 
tant roles in the (patho)physiology of these organs (12, 46-48). 
The high pancreatic and gastrointestinal tract expression of 
several KLK genes may also prove to be of significance since 
tissue kallikrein has already been shown to play a role in the 
(patho)physiology of these tissues (2, 49). 

In summary, we have physically mapped the KLK locus, 
providing a definitive fine map of this region at 19ql3.4 and 
have further clarified the size and expression patterns of the 
extended human KLK gene family. Our tissue mRNA expres- 
sion studies indicate KLK expression can be largely delineated 
between those having prostate restricted expression (JKLK2-4) 
and those having abundant pancreas expression (KLK6-13). 
These profiles suggest evolutionary conservation of elements 
conferring tissue specificity such as duplicated promoter ele- 
ments and locus control regions. Future studies will be directed 
at characterizing these elements and in elucidating the roles of 
k allikrein serine proteases in human biology and disease. 
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